# Efficient Quantized Inference

Gryphe Codex 24B Small 3.2 GGUF
Apache-2.0
This is a quantized version of Gryphe's Codex-24B-Small-3.2 model, which optimizes the running efficiency under different hardware conditions through quantization technology.
Large Language Model English
G
bartowski
626
3
Qwen3 4B GGUF
Apache-2.0
Qwen3 is the latest generation of large language models in the Tongyi Qianwen series, offering a complete combination of dense models and Mixture of Experts (MoE) models. Based on large-scale training, Qwen3 achieves breakthrough progress in reasoning capabilities, instruction following, agent functions, and multilingual support.
Large Language Model English
Q
prithivMLmods
829
1
Llava 1.5 13b Hf I1 GGUF
This project provides weighted/matrix quantized versions of the llava-1.5-13b-hf model, including various quantization types to meet the usage requirements in different scenarios.
Text-to-Image Transformers English
L
mradermacher
332
1
Qwen2 VL 7B Instruct GGUF
Apache-2.0
A quantized version of the multimodal model based on Qwen2-VL-7B-Instruct, supporting image-text-to-text tasks with various quantization levels.
Image-to-Text English
Q
XelotX
201
1
Eurollm 9B Instruct GGUF
Apache-2.0
EuroLLM-9B-Instruct is a multilingual instruction-following large language model supporting 40+ languages, with special optimization for European language processing capabilities.
Large Language Model Supports Multiple Languages
E
bartowski
901
13
Wizardlm 2 7B Abliterated GGUF
Apache-2.0
This is a quantized version of WizardLM-2-7B using llama.cpp, based on orthogonalized bfloat16 safetensor weight processing, supporting multi-turn dialogues.
Large Language Model
W
QuantFactory
139
2
Deepseek V2 Lite Chat IMat GGUF
GGUF quantized version of DeepSeek-V2-Lite-Chat, supporting multiple quantization types, suitable for local deployment and inference.
Large Language Model
D
legraphista
1,413
12
Mixtral 8x7B Instruct V0.1 Offloading Demo
MIT
Mixtral is a multilingual text generation model based on a Mixture of Experts (MoE) architecture, supporting English, French, Italian, German, and Spanish.
Large Language Model Transformers Supports Multiple Languages
M
lavawolfiee
391
28
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase